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I consider deterministic distinguishability of a set of orthogonal, bipartite states when only a single 
copy is available and the parties are restricted to local operations and classical communication, 
but with the additional requirement that entanglement must be preserved in the process. Several 
general theorems aimed at characterizing sets of states with which the parties can succeed in such 
a task are proven. These include (1) a maximum for the number of states when the Schmidt rank 
of every outcome must be at least a given minimum; (2) an upper bound (equal to the dimension of 
Hilbert space if entanglement need not be preserved) for the sum over Schmidt ranks of the initial 
states when only one-way classical communication is allowed; and (3) separately, a necessary and a 
sufficient condition on the states such that their original Schmidt ranks can always be preserved. Two 
additional theorems explicitly demonstrate a tradeoff between the extent to which the set of states 
fill Hilbert space, as measured by their Schmidt ranks, and how refined the parties must make their 
measurements, an important factor in determining the Schmidt rank the state can retain after it has 
been identified. It is shown that our bound on the sum of Schmidt ranks can be exceeded if two-way 
communication is permitted, and this includes the case that entanglement need not be preserved, 
so that this sum can exceed the dimension of Hilbert space. Such questions, concerning how the 
various results are effected by the resources used by the parties (amount of classical communication 
and types of local operations) , are addressed for each theorem. This subject is closely related to the 
problem of locally purifying an entangled state from a mixed state, which is of direct relevance to 
teleportation and dense coding using a mixed-state resource. In an appendix, I give an extremely 
simple and transparent proof of "non-locality without entanglement", a phenomenon originally 
discussed by Bennett and co-workers several years ago. 

PACS numbers: 03.67.Mn, 03.67.Hk 



I. INTRODUCTION 

Many of the most interesting problems in quantum in- 
formation involve two physically separated parties each 
acting on their part of a shared, entangled (bipartite) 
state. Due to their separation, it may not be practi- 
cable to bring the parts together to perform global op- 
erations on the entire system, though the parties may 
have the means by which to communicate classically with 
each other, perhaps to share information about outcomes 
of measurements. Thus, the parties may perform local 
operations and classical communication, LOCC as it is 
widely known in the literature. It is a reasonable ex- 
pectation that the separated parties are generally able 
to accomplish less with LOCC than they could if they 
brought the parts of the entangled system together. One 
example adhering to such an intuition is that only if the 
parts are brought together is it possible to increase the 
entanglement of the system on average [ij] . 

As is often the case in discussions of entangled systems, 
however, there have been surprises in store for us. One 
such surprise was provided by Walgate and co-workers 
who showed that it is not necessary to use global 
operations in order to distinguish between two orthogo- 
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nal, multipartite pure states. That is, if the parties are 
given a system that is in one of two possible orthogo- 
nal states, they can with certainty determine which state 
their shared system is in by means of LOCC alone. If the 
two states to be distinguished are product states, with 
an absence of quantum correlations between the parts, 
there is of course no reason to have expected otherwise. 
However, when the parts are correlated through quan- 
tum entanglement, one might have expected a need for 
global operations to learn about the differing correlations 
present in the two states to be distinguished. 

To accomplish the task of distinguishing, it was shown 
in [2I] that for any two orthogonal states, the parties sim- 
ply need perform standard, projective measurements on 
their separate parts, with one party's measurement con- 
ditioned on the other party's outcome, communicated to 
the former by means of a classical channel. By stan- 
dard, here, I mean that the measurements involve pro- 
jections onto one-dimensional subspaces (pure states) of 
the Hilbert space describing states of each local system. 
Thus, although the parties have succeeded in determin- 
ing the state they were given, the state they now share 
is, with certainty, a product state. Hence, this means of 
determining the state leads, necessarily, to a concomitant 
destruction of entanglement. 

It is by now well understood that entanglement is 
an important resource, examples provided by its use in 
quantum communication protocols such as teleportation 
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and dense coding Therefore, along with a desire 
by the two separated parties to discover which of the 
two (or more) states they share, they may also wish to 
preserve at least a portion of the entanglement inherent 
in those states. For example, suppose that Alice wishes 
to use teleportation to convey to Bob quantum informa- 
tion in the form of a quantum state. Suppose also that 
they share an entangled two-qubit system, described by 
a rank-2 mixed state, such that the two qubits may be 
viewed as being in one of two orthogonal maximally en- 
tangled states, but it is not known which one of these 
states they are in. Before teleportation may be accom- 
plished, the parties must determine which state actually 
describes their pair of qubits. 

For the above example of entangled qubits, we will see 
that it is in fact not possible to determine which state 
the qubits are in, while at the same time preserving en- 
tanglement. However, for higher-dimensional systems, 
there are conditions under which both tasks may be ac- 
complished simultaneously. This is the question we wish 
to study: When can a set of orthogonal, bipartite states 
be distinguished by LOCC while preserving some part of 
the original entanglement present in those states? In the 
next section, I will argue that Schmidt ranks of the states 
provide a useful characterization of these sets, as well as 
insight into ways of thinking about this problem. Most 
of our results will be stated in these terms. 

Throughout the paper, consideration will be restricted 
to cases where there is only a single copy of the given state 
available to the parties. An LOCC protocol should be 
understood to mean that the parties perform a sequence 
of measurements with each outcome communicated to 
the other party, who may then use that information in 
choosing the next measurement in the sequence. The fi- 
nal outcome is represented by an operator A ^ B, with 
A and B each equal to (ordered) products of Kraus op- 
erators or {Bm^} corresponding to individual 
outcomes {l,rn) in the sequence of measurements (here 
labeled by /i, v). 

In this paper, we will mainly be concerned with deter- 
ministic distinguishing, whereby the parties are always 
able to determine which state they have. Unless explic- 
itly indicated otherwise, the term "distinguishing" should 
be understood in this sense. In this case, the requirement 
of completeness may be imposed; for example, 

(1) 

I 

with Ia the identity operator on Alice's Hilbert space 
Ha- By the statement that the state \'i>j) is identified 
(or distinguished) by outcome A B while preserving 
Schmidt rank rj, we will mean that the Schmidt rank 
of the residual state {A (g) B)\'^j) is rj, and that {A (g) 
S)|*fe) =0, Vfe^,. 

The paper is organized as follows: In the next sec- 
tion, a simple example is given to illustrate the basic 
ideas, and then it is argued that Schmidt ranks will be a 
useful quantity for characterizing distinguishability with 



preservation of entanglement. In Section Hill a brief out- 
line of the types of protocols to be discussed is given. 
Section [TVl presents the main theorems including (1) a 
maximum for the number of states in the case that every 
outcome must preserve a fixed minimum Schmidt rank; 
(2) an upper bound for the sum over Schmidt ranks of the 
initial states when only one-way classical communication 
is allowed, again assuming that every outcome preserves 
a fixed minimum Schmidt rank; and (3) conditions on 
the states such that their original Schmidt ranks can al- 
ways be preserved. Two additional theorems are given in 
Section |Vl explicitly demonstrating a tradeoff between 
the extent to which the set of states fill Hilbert space, 
as measured by their Schmidt ranks, and how refined 
the parties must make their measurements. In all cases, 
the effects of restricting or expanding the resources avail- 
able to the parties (types of local operations and amount 
of classical communication) is discussed. In particular, 
various examples are given where two-way communica- 
tion allows the parties to accomplish what our theorems 
show cannot be accomplished with one-way communi- 
cation alone. One of these examples demonstrates that 
the sum of Schmidt ranks can exceed the dimension of 
Hilbert space, yet the states can nonetheless be deter- 
ministically distinguished. Other examples show that 
when general separable operations are allowed, the par- 
ties can do even better than they can using LOCC with 
two-way communication. Several of the simpler proofs of 
these theorems are included in this section, whereas the 
more lengthy proofs are given in Appendix [X] Then, in 
Section IVIl I point out the close correspondence of the 
present study to the important question of using LOCC 
to obtain a pure entangled state from a single copy of a 
mixed state. Extension to multipartite systems for two 
of the theorems is also discussed. Finally, in Section fVl II 
I present a summary of the results. In one of the appen- 
dices, I include a very simple proof of nonlocality without 
entanglement, using a transparent and intuitively clear 
argument. The final appendix discusses LOCC protocols 
where the parties are not allowed to communicate until 
after they have completed their measurements. 



II. CHARACTERIZATION BY SCHMIDT 
RANKS 

In this section I argue that it will be useful, in char- 
acterizing a set of states to be distinguished, to consider 
the Schmidt ranks, Rj, of the states in that set. Let us 
begin with a very simple example to illustrate the gen- 
eral idea of distinguishing and preserving entanglement 
(additional examples will appear in the following sections 
as illustrations of the theorems). The example involves 
two states on a 4 x 4 system (I omit normalization where 
it is unimportant): 

|*i> = mAB + |13)ab + \20)ab + \il)AB, 

1*2) = mAB + \n)AB + \22)ab + |33)ab. (2) 
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FIG. 1: Representation of the set of states given in Eq. 0. 
Alice's basis states are denoted along the left side of each 
grid; Bob's along the top. The numbers [j) inside the boxes 
indicate the state has the corresponding product state 

as a component, (a) For the bases of Eq. ([2}, it is easily seen 
that the states can be distinguished by LOCC, preserving 
Schmidt rank of 2 for all outcomes, (b) When viewed in other 
bases, the distinguishability may be far from obvious. 

Alice and Bob perform measurements, each with two out- 
comes corresponding to projectors, which with a = A or 
B, are 

P»i = |0)„(0| + |1)„(1|, 

Pc.2 = |2),(2| + |3)„(3|. (3) 

If Alice obtains outcome 2, for example, then 
|$i> =Pa2|*i> = \20)ab + \SI)ab, 

|*2> =Pa2|*2) = |22)^B + |33)Ai3, (4) 

which leaves Bob with reduced density operators (p^ = 

Tryi(|^'j)(\['j|)) proportional to Pbi or Pb2, respectively. 
Then if he obtains outcome 1, and if they communicate 
their results to each other, they will know that the state 
was l^*!), and more importantly, that they now share 
the state Any other pair of outcomes leads to the 

same sort of conclusion: they know which state they had, 
and also know the state that remains, that being uni- 
formly entangled across 2-dimensional subspaces. In this 
example, Bob was able to use the same measurement 
regardless of Alice's outcome, though they still had to 
communicate classically in order to determine the state. 

The representation of the original states in Fig. [^a) 
provides intuition as to what can and cannot be accom- 
plished. It is apparent from this diagram that the two 
states each fill too much of the space, and are too in- 
tertwined with each other, for it to be possible to 
distinguish and preserve Schmidt rank of r = 4 even 
for a single (LOCC) outcome. On the other hand, it 
is quite clear that there is "room" enough for them to 
be distinguished preserving r = 2. Of course, since we 
are dealing with quantum systems, an unlimited number 
of other bases are available to us for representing these 
states. One other choice is shown in Fig. [T]Jb), in which 
the distinguishability of the states is by no means clear, 
let alone the possibility of preserving entanglement in the 
process. In the latter depiction, it appears that each state 



by itself fills the whole space. If we were dealing with 
classical probability distributions, this conclusion would 
be correct and distinguishing the distributions would be 
impossible. For the case we are considering, however, 
the existence of quantum superpositions forces us to re- 
examine what is meant by the notion of "filling space" . 
We need a way to measure how much space a given state 
occupies, and if possible, to what extent the states are 
intertwined with each other. The Schmidt rank of the 
states provides such a description. When a state is writ- 
ten in its Schmidt basis, the part of the space it "occu- 
pies" is minimized, and it is apparent in diagrams such 
as Fig. [l](a) just where that region is. Furthermore, one 
can see in this diagram the level of intertwining amongst 
the states, and this would be true, at least qualitatively, 
even if the bases used were the Schmidt bases for only 
one of the states. This argument should at least make it 
plausible that consideration of Schmidt ranks, Rj, of the 
original states will be an advantageous approach, and this 
is what we shall do in the following sections. We shall also 
find it useful to consider Schmidt ranks, r^, of residual 
states, those remaining after the parties complete their 
measurements. We will see in the various theorems be- 
low, that the ranks Rj and the amount of entanglement 
that can be preserved, as measured in somewhat quali- 
tative terms by rfc, are two closely related quantities. 

The protocol described above succeeds in preserving 
entanglement by partitioning the respective spaces into 
subspaces that are larger than one-dimensional. In con- 
trast, the approach of [2] utilized projections onto pure 
states, which is clearly inadequate for the purpose of pre- 
serving entanglement since it always leaves them with a 
product state. One way of looking at this is that they 
have constricted the states too much, squeezing out all 
of the entanglement. This difficulty can be overcome by 
relaxing one's grip, projecting onto higher-dimensional 
subspaces in making measurements. The tradeoff is that 
more entanglement means less information: the higher 
the dimensionality of the subspaces in the partitions, the 
more entanglement can be preserved, but less informa- 
tion about the state is obtained, making it more difficult 
to distinguish the states. So while it is always possible to 
distinguish a pair of orthogonal states, the added require- 
ment of preserving entanglement leads us to a very rich 
structure with many challenging and interesting prob- 
lems to investigate. We begin such a study in the follow- 
ing sections. 



III. TYPES OF PROTOCOLS 

Our aim in this paper is to characterize sets of bipar- 
tite states, S = which allow the parties to 
distinguish while preserving entanglement. Such a char- 
acterization does not depend solely on properties of S, 
however, but also on the tools that are available to the 
parties as they attempt to accomplish this task. In gen- 
eral, we will restrict the parties to LOCC, so there are 
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two such tools we wish to consider: (1) the types of local 
operations (LO) they are able to implement; and (2) the 
amount of classical communication (CC) they are allowed 
to share with each other. Whenever possible, extension 
to the class of separable operations [jTI will be consid- 
ered, with comments on how this might enlarge the class 
of allowable sets S. 

The two types of local operations to be discussed 
are orthogonal, projective measurements and generalized 
measurements. As illustration, suppose Alice performs 
a measurement represented by the set of Kraus [H op- 
erators Ai, which must obey the completeness relation, 
Eq. For a generalized measurement, this is the only 
constraint, while for an orthogonal, projective measure- 
ment, we also have that AiAi' — AiSw . In the latter case, 
the local Hilbert space is divided into mutually orthogo- 
nal subspaces by the operators Ai , whereas in the general 
case the Ai may divide the space into subspaces that over- 
lap with each other to an arbitrary extent. For classical 
communication, we will assume the parties have access 
to a classical channel that can either carry information 
in only one direction (one-way CC) or in both directions 
(two-way CC). In the former case, one party must mea- 
sure first and then communicate the outcome of their 
measurement to the other party, who must then complete 
the protocol without additional assistance from the first 
party. For two-way CC, they can go back-and- forth mea- 
suring and exchanging information as many times as is 
needed, conditioning subsequent measurements on previ- 
ous outcomes. We will also consider protocols where the 
parties are only allowed to communicate after they have 
completed their measurements. 

Thus, we consider seven types of protocols: 

1. Orthogonal projectors with CC only after measure- 
ments are completed (LOCC-PO) 

2. Generalized (Kraus) operations with CC only after 
measurements are completed (LOCC-KO) 

3. Orthogonal projectors with one-way CC (LOCC- 
PI) 

4. Generalized (Kraus) operations with one-way CC 
(LOCC-Kl) 

5. Orthogonal projectors with two-way CC (LOCC- 
P2) 

6. Generalized (Kraus) operations with two-way CC 
(L0CC-K2) 

7. Separable operations (SEP) 

Since a projector is a Kraus operator (but not vice- versa), 
and since LOCC operations are a proper subset of SEP, 
there is a trend toward more general protocols as one 
moves down the list, as well as toward more sophistica- 
tion in the resources needed to implement them. There- 
fore, conditions on S necessary and/or sufficient for one 
of these types of protocols have implications for other 




FIG. 2: Lattice diagram showing relationships between the 
various types of protocols discussed in the text. A connected 
path upward (downward) from one vertex, possibly passing 
through others, to a second vertex indicates that a sufficient 
(necessary) condition on the first implies the same for the 
second. 

protocol types. Figure [2] illustrates the specific relation- 
ships. We will be interested in characterizing the sets of 
states that allow distinguishing with preservation of en- 
tanglement for each of the protocol types. Ideally, one 
would like to have a complete characterization describing 
precisely which sets of states are allowable in each case. 
Such a lofty goal must await further efforts, but I hope, 
nonetheless, that the results presented below will be of 
some interest to the reader. 



IV. MAIN THEOREMS 

Our main results are presented in this section, in the 
form of several theorems and corollaries aimed at charac- 
terizing sets of states which allow two parties to distin- 
guish while preserving entanglement using LOCC. These 
results are discussed with particular attention to how 
these characterizations may be effected by a change in the 
specific type of LOCC (or SEP) protocol used. For exam- 
ple, if a theorem provides a bound on a certain quantity, 
such as the number of states that can be included in the 
set, that bound, as well as whether or not it is a tight 
bound, may depend on the protocol. Whenever possi- 
ble, we provide comments on such questions. To aid the 
flow of the discussion, the longer proofs are not presented 
here, but are given in an appendix. 



A. Maximum number of states 

If the parties need not preserve entanglement, there is 
an obvious upper bound, N < DaDb, on the number 
of states that can be distinguished by LOCC (or other- 
wise) if the space has dimension DaDb- The following 
theorem generalizes this result to the case where entan- 
glement must be preserved, providing a relationship be- 
tween the filling of Hilbert space by the initial states and 
the entanglement (Schmidt ranks r^) that can be pre- 
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FIG. 3: A set of states achieving the bound of Theorem [T] 
using LOCC-PO. Each numbered box represents an r x r 
subspace, and A'max = 25 with [DA/r\ — 5 — [-Ds/rJ in this 
example. 



served. 

Theorem 1 Suppose the parties share a Da^ Db system 
and using L0CC-K2 are able to distinguish with certainty 
amongst a set of N states while preserving Schmidt rank 
of at least r for every outcome. Then, 



N < [DA/r\ [DB/r\ = N„ 



(5) 



where [x\ is the largest integer not greater than x, and 
this upper bound is achievable by LOCC-PO. 

Thus, for each of the types of LOCC, the number of 
states can be as large as iVmax and no larger. The idea of 
the proof, presented in Appendix lA li is to sequentially 
introduce divisions of Hilbert space into orthogonal sub- 
spaces in a way consistent with LOCC and such that 
after the final division, no subspace has dimension larger 
than r on either party's side. The maximum number of 
states on the whole space is then bounded above by the 
sum over maximum numbers of states on the subspaces, 
which for this particular method of division yields the 
upper bound given in the theorem. This bound is tight 
for all (LOCC) protocol types, as there exist sets of A^max 
states that can be distinguished by LOCC-PO. Such a set 
of states is depicted in Fig. [3l where each of the numbered 
square blocks represents an r x r subspace. 

If separable measurements are used, it is possible to 
have N > A^max- A specific example [8] of such a set in 
3x3 with r = 2 is 

|vl/i) = |00) + |22), 
|vI/2) = |01) + |12), 

1*3) - |10) + |21). (6) 

The separable POVM = At„„A™„(K)Btj„S™„, with 

Aran, Bmn the Corresponding Kraus operators) which dis- 
tinguishes this set is 
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FIG. 4: The three states of Eq.[6l which can be distinguished 
by the SEP POVM of Eq. [7] preserving Schmidt rank of r = 2 
for all outcomes, a task that cannot be accomplished by any 
LOCC protocol. 



with a = (2 - n/3)/4, /3 = 2 + V3, and E,„,„ = 
Ia® Ib- E,nn identifies state I'I'm) and preserves r = 
2 in all cases. Although Mnax = [3/2j[3/2j = 1, we 
here have three states in a set that is deterministically 
distinguished by SEP preserving r ~ 2. These states are 
depicted in Fig. HI It should be at least plausible from 
this diagram that no LOCC protocol can succeed at this 
task, and that this remains true even if one of the three 
states is removed, a view confirmed by the theorem. 



B. Schmidt rank sum for one-way protocols 

1. An upper bound for one-way CC 

We now consider the sum over Schmidt ranks {Rj) of 
a set of states (|^'j)) which is perfectly distinguishable 
by LOCC on Da ^ Db- There exist a number of in- 
teresting results in the literature d 0, [n [n, in that 

together suggest the following intuitively pleasing upper 
bound on this sum: Rj < DaDb (there is no consid- 
eration of preserving entanglement in these papers) . For 
example, (1) no more than D maximally entangled states 
on a D X D system (^ i?, < D^) can be perfectly dis- 
tinguished [1j1 (see also [10, UH), and (2) if a complete 
basis is p erfectly distinguishable, it must be a product 
basis [141. In these papers, the parties are allowed to use 
two-way communication. The following theorem gener- 
alizes this upper bound to the case where entanglement 
must be preserved, but is proved only for a restriction to 
one-way classical communication. In fact, I will show by 
means of counter-examples in the next subsection that 
the bound in this theorem can be exceeded when two- 
way communication is allowed, and this includes the case 
that entanglement need not be preserved (r = 1 in the 
theorem). 

Theorem 2 // Alice goes first using LOCC-Kl and 
the parties are always able to distinguish and preserve 
Schmidt rank at least r, then ^ ■ Rj < Da [i^s/rj . 



The proof is given in Appendix lA 21 Another rather ob- 
vious upper bound is -Dmin-^max, where A^max is given 
1|), (7) in Theorem [1] and Dmin is the smaller of Da and Db- 
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Clearly, the bound in the theorem cannot be achieved if 
it is larger than i'min-^max- Assuming both dimensions 
are at least r (otherwise A'max — 0), this will only be 
the case when Dmin — Db < Da/IDa/tI < 2r so that 
-/Vmax = [DA/r\. In this case, DB[DA/r\ is a tight up- 
per bound, realized by a set of \_DA/r\ rank-Ds states 
placed into orthogonal Db y- Db subspaces. Then, the 
parties can always preserve Db > r using LOCC-PO (Bob 
need not measure at all). For all other cases, the bound 
in the theorem is tight (for LOCC-Kl), as is also shown 
in Appendix I A 21 

There are many cases where the bound of Theorem [2] 
can be reached when the parties can only use LOCC-PO. 
For example, if Da is divisible by r, it is easy to construct 
a set of states that will do this, such as can be visual- 
ized by deleting the small rectangles along the bottom of 
Figure [3l Nonetheless, there are also cases where LOCC- 
Pl is not sufficient for the parties to succeed unless the 
Schmidt rank sum is strictly less than DA\_DB/r\, and 
we do not have a tight bound that applies in general for 
this type of protocol. An example is Da = 2r + l = Db, 
where the bound in the theorem is Ar + 2. Suppose the 
set contains the maximum of 4 states (other cases may be 
analyzed in a similar way). The best Alice can do with 
orthogonal projectors is to divide her space into two sub- 
spaces, one of dimension r and the second of dimension 
r + 1, and the same goes for Bob after he is informed 
of her outcome. When Alice obtains the r-dimensional 
outcome, the two states left (if Bob can then distinguish 
preserving r) must be Schmidt rank Rj = r. If one of 
them has rank greater than this, part of that state will lie 
in Alice's other subspace, meaning that when she obtains 
her (r + l)-dimcnsional outcome, they will not be able to 
distinguish the other pair of states from this one. On the 
other hand, when Alice gets her larger outcome, one of 
the second pair of states could have had rank Rj = r+l, 
but the other must have had rank r or else it will not 
now be distinguishable from the first (to see this apply 
the theorem, noting that now Bob is going first). Hence, 
three of the states must have started with rank-r, the 
fourth with rank r + l, and the sum of these ranks is 
Ar + 1 < Ar + 2: the bound cannot be achieved. 
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FIG. 5: The states of Eq. (O, having J2 = 12, exceeding 
the (one-way) bound of Theorem [2l Rj < Da^Db /r\ — 
10 with r = 2. See text for detailed two-way protocol. 



measurement operators. 



= \0)a{0\ + \1)a(1\ + \2)a{2\ + ^\3)a{31 



A2 = i=|3)A(3| + |4)^(4|. 



(9) 



If she gets outcome Ai, Bob designs his measurement as 
Bi = |0)b(( 



B2 = \2)b{2 
1 



1 



V2 



^|3)s(3|, 
|3)b(3| + |4)b(4|. 



(10) 



after which Alice can then distinguish and preserve r = 2 
in all cases. If Alice gets outcome A2, then Bob can 
easily distinguish the remaining states and again preserve 
r = 2. 

Can this bound be exceeded when r — 1 and the par- 
ties use two-way communication? As stated above, sev- 
eral results seem to sug gest that the answer may well be 
negative Ell, fla Il3l|. However, I now give a set 

of distinguishable states on 3 x 3 for which the sum of 
Schmidt ranks is ^ i?j = 10 > 9 = DaDb- The states 
are, 



2. Doing better with two-way CC 

We will now see that the upper bound in Theorem [2] 
does not apply if two-way communication is allowed. An 
example is given by the four states on a 5 x 5 system 
represented in Figure [51 with 



= ^{mAB + W)AB) + mA + \l)A)\l)B, 

1*2) - -^(|00)^b + |$o2)ab)-(|0)a + |2)a)|1)b, 



1*3) = |10)ab, 
1*4) = |20)ab, 



1*5) = 1*12) AS, 
1*6) = 1*22) AS, 



(11) 



1*1) = |00)as + |11)ab, 

1*2) = |02)as + |13)ab + |24)as, 

1*3) = |20)as + |31)ab + |42)as, 

1*4) - |04)as + |22)ab + |33)ab 



|40) 



AB- 



(8) 



The sum of Schmidt ranks is now 12 > DaYDb/t] = 
10, with r = 2. Alice starts with the following pair of 



with 



l<fo)A = i(|0)A + 2|l)A + 2|2)A), 

i(2|0)A + |l)A-2|2)A), 

i(2|0)A-2|l)A + |2)A), 



|*o)a 
|*i)a 

|*2)a 



(12) 
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forming an orthonormal basis. Bob starts with the fol- 
lowing pair of measurement operators, 

Bi = \0)b{0\ + ^\1)b{11 

B2 = |2)b(2| + -^|1)b(1|. (13) 

If Bob obtains Bi, j^'s) and |5'g) are excluded, j^'a) and 
1 4*4) are unchanged, and (apart from unimportant nor- 
malization) 

l^-i) ^ |00)^b + (|0)^ + |1)^)|1)b, 

14-2) ^ |00)ab-(|0)^ + |2)^)|1)b. (14) 

Alice follows with an orthogonal projective measurement 
onto the standard basis in her space. For each of her 
outcomes, only two states remain and are still orthogonal, 
and Bob can then distinguish which one they have. 

When Bob obtains B2 , it turns out that the basic struc- 
ture of the remaining states is exactly the same as for Bi , 
the only difference being that Alice must now measure in 
the basis of the orthogonal states, |<i>A:)A- This is easily 
seen by recognizing that 

|0)a + |1>a = |$o)a + |*i)a, 

|0)a + |2)a = |$o>A + |$2>A, (15) 

In fact, the structure of the original states was also the 
same, considered from the point of view of the |$fc)A ba- 
sis as compared to Alice's standard basis. Hence, the 
parties can also distinguish the states with certainty for 
B2, and the bound — B>aDb has been exceeded. 

The generalization of this construction to higher dimen- 
sions will be discussed elsewhere [l^ . 

For a 3 X 3 system with r > 2, A^max = 1 so the 
Schmidt rank sum cannot exceed 3 = Da ^Db /r\ even 
if L0CC-K2 is employed. Separable operations, on the 
other hand, allow this sum to be at least equal to 6, as has 
already been demonstrated by the example of Eq. It 
would be useful to have a (non-trivial) upper bound on 
the Schmidt rank sum for general LOCC and for SEP, 
but we are unable to provide one here. 



C. Preserving the original Schmidt ranks 

Given a set of states to be distinguished, perhaps the 
most difficult task, and the ideal outcome, would be 
to distinguish while preserving the original state intact. 
Failing this, it might nonetheless be possible to preserve 
the original Schmidt ranks. Here, we consider this prob- 
lem and give, separately, a sufficient and then a necessary 
condition such a set must satisfy. 

There is a sufficient condition which is almost trivially 
obvious: if all the reduced density operators are orthog- 
onal on one side or the other then only one party need 
measure, and they can distinguish preserving Rj using 



LOCC-PO. A less trivial sufRcient condition is given be- 
low as Theorem [31 in which I use the notion of a "cas- 
cading sequence of partitions" , defined as follows: start- 
ing with an arbitrary set of states and considering their 
reduced density operators {pf}, partition these into dis- 
joint subsets such that each is orthogonal to all those 
corresponding to states in different subsets; then par- 
tition each of these subsets into smaller subsets in the 
same way except by considering {pf}', and so on back 
and forth for as many steps as is possible. We will call 
this partitioning "complete" if each final subset consists 
of a single member. 

Theorem 3 The set of states is perfectly distin- 

guishable by LOCC while preserving Rj provided these 
states can be completely partitioned by a cascading se- 
quence, as defined above. Indeed, under these condi- 
tions, the state may be preserved unchanged by L0CC-P2 
(LOCC-Pl if there are only two levels to the sequence, 
one for Alice and one for Bob). 

The proof of this theorem is quite simple. The parties 
need just perform orthogonal measurements projecting 
onto the union of the supports of the appropriate density 
operators in each subset: the first measurement is cho- 
sen to correspond to the first level of the partitioning se- 
quence; the second measurement is chosen to correspond 
to the subsets descending directly from that subset iden- 
tified by the outcome of the first measurement, etc. I 

Given the reduced density operators of the states on 
both sides, the condition may be checked in a fairly 
straightforward way. For the first level of partition, start 
by placing p^ in a first subset Si , and check to see if P2 
is orthogonal to it. If not, also include the latter in Si\ 
and otherwise put it into Now check pf: if it is or- 
thogonal to both pi and p^, include it in a new subset; 
otherwise, include it with the one it is not orthogonal to 
— if it is orthogonal to neither, then they must all be 
included in the same subset even if and are or- 
thogonal to each other. Continue in this way until all 
states are partitioned into subsets. For subsequent levels 
of partition, start with each subset appearing on the pre- 
vious level and partition that subset as described above 
for the first level. If the previous level was partitioned 
according to Alice's density operators, then for the next 
one use Bob's, and vice- versa. If this process can be con- 
tinued until all subsets contain only a single state, then 
the states can be distinguished without being altered. If 
not, then one should check again, this time starting with 
Bob's side instead of Alice's. While not exactly simple, 
it is nonetheless a relatively straightforward procedure, 
which could be readily coded as an algorithm for numer- 
ical implementation. 

This procedure is illustrated by the following set of 
states, represented in Fig. [51 We have 

1*1) = |00)as + |11)ab, 

1*2) = |02>AB + |13>AB, 

1*3) = |2)a(|0)b + |2)b) + |3)a(|1>b + |3>b), 
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FIG. 6: Illustration of the procedure of cascading partitions 
for testing Theorem O described in the text (the states are 
given in Eq. (US}). Bob can separate out {pf } since it is 
orthogonal to all the others, after which Alice can divide the 
remaining four into {pi*,P2'} and {p3^,p4^}. Then Bob can 
complete the partitioning. 



|0)s jl>B |2)i 



|0>A 
!1>A 
|2>A 



FIG. 7: Demonstration that Theorem [3] does not provide 
a necessary condition. These product states can be distin- 
guished (so Rj = 1 is preserved in all cases), but they cannot 
be completely partitioned by a cascading sequence. In fact, 
they cannot be partitioned even once into two non-empty sub- 
sets where the reduced density operators in one subset are 
orthogonal to all those in the other. 



1*4) = |2)^(|0)b-|2)b) 
|*5> = \U)ab + \25)ab- 



|3)a(|1>b-|3), 



(16) 



Notice first that Alice cannot start the procedure, since 
her density operators do not partition into two non- 
empty subsets such that all those in one subset are or- 
thogonal to all those in the other. On the other hand, 
Bob can separate out {pf} since it is orthogonal to all the 
others, after which Alice can divide the remaining four 
into {Pi,P2} and {pf^pf}- Then Bob can complete the 
partitioning (his corresponding measurement will depend 
on the outcome of Alice's preceding one). 

This condition is not a necessary one. It is not satisfied 
by the distinguishable set of product states (see Fig. [7]), 

1*1) = (|0)^ + |1)a)|0)s, 
|*2> - \2)a{\0)b + \1)b), 
1*3) = |0)^(|1)b + |2)b), 

1*4) = i\l)A + \2)A)\2)B. (17) 
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FIG. 



(a) (b) 
8; Intuitive picture indicating how Theorem [3] can be 



proved. The parts of this figure correspond to the two ways it 
can happen that P2pi 7^ (only selected components of [^'2) 
are shown). In either case, any individual measurements the 
parties can perform that preserve Rj leave the picture essen- 
tially unchanged, which means they have not distinguished. 
See Appendix I A 31 for a detailed proof. 



Looking at Bob's reduced density operators, for exam- 
ple: the first is not orthogonal to the second, the second 
not to the third, etc. Since the same argument holds on 
Alice's side, these states cannot be partitioned even once 
into two non-empty subsets where the reduced density 
operators in one subset are orthogonal to all those in the 
other. Nonetheless, they can readily be distinguished by 
LOCC-PO using projective measurements in the standard 
basis on both sides. 

A necessary condition is given in the next theorem, 
stated in terms of a set of density operators defined as, 



pf®pf. 



(18) 



Theorem 4 If a set of states (l^'j)} is perfectly distin- 
guishable by LOCC while preserving Rj, then the density 
operators {pj} form a mutually orthogonal set. 



The proof of this theorem is presented in Appendix lA 31 
The idea behind the proof can be seen from the follow- 
ing discussion. If P2P1 7^ 0, there are two possible ways 
this may come about, as indicated in parts (a) and (b) 
of Fig. [H The idea is that the two states are too closely 
intertwined in each case for them to be separated with- 
out significant distortion (that is, without a decrease in 
Schmidt rank). The first possibility is shown in part (a) 
of the figure, in which a component of [^'2) hes within 
the i?i X i?i box representing the region of Hilbert space 
that is fully (at least according to the reduced density 
operators) occupied by As is shown for this case 

in Appendix lA 3i neither party can "remove" {^2) from 
the Y^i) box by any complete LOCC measurement with- 
out reducing the Schmidt rank of one or the other of 
the states. Therefore, the picture shown in the figure 
persists throughout their protocol, no matter how many 
rounds of measurements they make. This means they can 
never eliminate \^2) while preserving and must fail 
to distinguish. The second case, illustrated in Fig. [Sl^b), 
is argued in essentially the same way. Note that it may 
be possible for the parties to implement individual mea- 
surement operators that separate the states and preserve 
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Rj, but it is not possible for them to do so for every 
outcome of a complete measurement. 

Every set of orthogonal product states satisfies the 
conditions of this theorem, but it is well known not ev- 
ery such set can be distinguished by LOCC, demonstrat- 
ing that the condition of the theorem is not a sufficient 
one. The best known example of such a set of product 
states was provided by Bennett and co-workers [15 ] in 
their discussion of "nonlocality without entanglement" . 
Other proofs of this phenomenon, simplifying that of the 
original paper, have appeared in the literature [T^. 
However, it does not appear to this author that any of 
these proofs is particularly transparent or intuitive. In 
Appendix [B] I supply such a proof, where it is shown 
in a very simple and direct way that the parties cannot 
perform any local operation other than a unitary with- 
out destroying the orthogonality of the states. Since a 
unitary operation cannot yield any information, nor can 
it eliminate even one of the states, the parties cannot 
distinguish this set of states. 

The following example also demonstrates the condition 
of this theorem is not sufficient, this time with entangled 
states. The pj are orthogonal, but they cannot be dis- 
tinguished since there is no measurement either party 
can make that is less than full rank without reducing the 
Schmidt rank of at least one of the states. The states are 

= \01)ab + \12)ab. 

1*2) = \n)AB + \2A)AB, 

|*3> - |20)ab + |31)ab, 

1*4) = |32)^s + |43)ab. (19) 

For example, if I'^i) is not eliminated for outcome Ai, 
then the support of Ai must include a two-dimensional 
subspace that is not orthogonal to either \0)a or |1)a- 
Then {'^2) is not eliminated so the support of Ai cannot 
be orthogonal to \2)a, etc. On the other hand, if is 
eliminated, then the kernel of Ai must include \0)a and 
|1)^, which means that |5'2) must also be eliminated so 
the kernel of Ai must include \2)a as well, etc. Thus, 
since we may assume Ai ^ 0, the rank of Ai must be Da- 
It is true that the structure of these states is altered by 
this operation — for example, Ifc)^ Wk)^ — but while 
the laj.)^ need not be orthogonal, they do need to be 
linearly independent. Then, an argument similar to the 
above will again show that subsequent measurements by 
the two parties must all be full rank. This means they 
can never eliminate even a single state, so this set cannot 
be distinguished without reducing at least one of them 
to a product state. 

The condition of the theorem is not necessary for SEP. 
The states of Eq. ([5]) provide a counter-example, since 
as already shown they are distinguishable by SEP while 
preserving the original Schmidt ranks, but the density 
operators pj, corresponding to these states, are not mu- 
tually orthogonal. It is conceivable, on the other hand, 
that the condition of the theorem is sufficient for SEP, 
but we do not know if this is the case. Given the pj are 



mutually orthogonal, one might try constructing a sep- 
arable measurement starting with orthogonal projectors 
Aji^Bj, one for each state such that the support of 
Aj (Bj) is equal to that of p^ (pf)- However, if the set 
of states is an unextendible product basis [3, [l^ (each 
such set satisfies the conditions of the theorem), then the 
projector onto the remaining part of Hilbert space is pro- 
portional to a bound entangled state, meaning that no 
separable operation exists to complete this measurement. 
The starting point of this argument is a very special set 
of operations, so it does not constitute a proof the states 
arc indistinguishable by SEP. In fact, it has been proven 
that every unextendible product basis in 3 x 3 is distin- 
guishable by SEP [l^ , so sufficiency for SEP remains an 
open question. 

Since the rank of pj is i?| and we know from the pre- 
vious theorem that deterministic distinguishing while al- 
ways preserving Rj requires the set of these density op- 
erators to be mutually orthogonal, we have 

Corollary 5 If a set of states can be perfectly distin- 
guished by LOCC while always preserving Rj, then 

N 

J2R^j<DaDb. (20) 

Once again, the set of states in Eq. ([6]) provides an 
example showing that this corollary does not hold for 
SEP. For these states, DaDb = 9 whereas J2j = 12- 



V. ADDITIONAL THEOREMS 

We now give two additional theorems, which relate 
Schmidt ranks of the states in the original set to be dis- 
tinguished with those of the residual states. In particular, 
I consider how the largest Schmidt rank rj that can be 
preserved for state jvl/j) is constrained by the collection 
of original Schmidt ranks {Rj'} and the dimensions Da, 
Db of the Hilbert spaces TLa, "Hb- It will be convenient 
to write the original states as 

D 

nm\m)A\n)B: (21) 

m,n— 1 

with Mj a matrix of rank Rj . Then for a given measure- 
ment outcome, A (g) i?, the parties will be left with 

D 

nni\m)A\n)B, (22) 

m.n—1 

where is the transpose of the matrix A. In the fol- 
lowing, we will use two facts: 

1. the Schmidt rank, r^-, of the residual state is given 
by the rank of the matrix r{BMjA^); and 



2. if l^fj) is identified deterministically (or unambigu- 
ously [13]) by outcome A(g) B, then BMkA^ = 
Vfc^j. 

We will also find useful in this section two inequalities 
on matrix ranks , which say that for m x I matrix X 
and I X n matrix Y, the rank r[XY) of their product is 
bounded as 

min[r(X), r(r)] > r{XY) > r{X) + r{Y) - I. (23) 

The first theorem we will consider concerns general, two- 
way protocols and applies in both the deterministic and 
unambiguous cases. 

Theorem 6 Given the task of deterministically or un- 
ambiguously distinguishing a set of bipartite states, 
having Schmidt ranks {Rj}, then for every separa- 
ble outcome Am Bm distinguishing and preserving 

'j ' 

2r-™ + max(i?fe) <Da + Db. (24) 

Proof: For either the deterministic or unambiguous case, 
we have that r™ = r{BmMjA^) implying r{B„i) > rj^ 
and r{A^) > rf, and riB^MtAl^) = Vfc^j. From the 
latter expression with Eq. (I^S)) . we have 

> r{Bm)-^r{MkA'i)- Db 

> r{Bm) + r{Am) + r{Mk) -Da-Db 

> 2rf +Rk- Da- Db. (25) 

and the theorem easily follows. ■ 
Note how this expression explicitly shows the tradeoff 
between the original and final Schmidt ranks, in rela- 
tionship to the Hilbert space dimensions. The following 
corollary offers one example of how this result can be 
useful. 

Corollary 7 // any of the original states, say the first, 
has Schmidt rank Ri = Da (Da < Db), then one can- 
not preserve Schmidt rank exceeding -Db/2 for any single 
SEP outcome identifying \^ j) with certainty when j =/= 1- 
If any two states start out with Schmidt ranks equal to 
Da, then no outcome can preserve greater than Db/2. 

When Db/^ < Da, these statements are non-trivial and 
are a consequence of the extent to which the rank-Z?^ 
states are spread through the space, so they cannot be 
annihilated by measurement operators of rank exceeding 
Db/2. In general, as the largest Schmidt rank i?max de- 
creases, less of the space is occupied by the corresponding 
state, which can then be annihilated by higher-rank op- 
erators, allowing larger Schmidt rank to be preserved for 
other states. From another point of view, the amount of 
information required to distinguish decreases along with 
Rmax, so the parties may use less refined measurements 
allowing to be greater. 

According to the proof of this theorem, if any single 
state I^'k) is excluded by the outcome m, even if there are 
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FIG. 9: Illustration of Theorem H (a) For Da = 2 and 
-Ds = 3, it is not possible to preserve entanglement while 
distinguishing if more than one state is initially entangled, (b) 
Increasing Db allows higher- rank states to be fully separated 
from each other, and then preserving entanglement becomes 
possible. As seen in the right half of this diagram, however, 
the presence of additional states can alter this conclusion, 
(c) For a 3 X 3 system with Ri — 3, |*2) cannot remain 
entangled after being identified by the measurements, (d) 
Again, increasing the size of 7is allows more space for the 
states and even with i?i = 3 it is possible to preserve r2 — 2. 



non-zero probabilities for identifying several other states, 
the bound in the theorem still holds with maxfc^j(i?fc) 
replaced by i?^. Then the result becomes applicable to 
protocols that allow for errors in identifying the state. 

We see from this theorem that when Da = Db = 2, 
it is not possible to distinguish by SEP while preserv- 
ing entanglement, even for a single outcome (this con- 
clusion holds for all protocols that exclude at least one 
state for every final outcome, so is not restricted to de- 
terministic, or even unambiguous, distinguishing). Other 
simple examples giving an intuitive picture for this the- 
orem are presented in Fig. [9l For the case Da = 2 and 
Db ~ 3 {Db/'2 < Da), preserving entanglement while 
distinguishing requires that no more than one state is 
initially entangled. In part (a) of the figure, we see 
that with both states rank-2, they are necessarily too 
intertwined for entanglement to be preserved. Increasing 
Db allows higher-rank states to be fully separated from 
each other, and preserving entanglement becomes pos- 
sible (Fig. m^b)). With a 3 X 3 system no more than 
one of the states can have Rj = 3, and if one does 
have this rank, the other states cannot remain entan- 
gled after being identified by the measurements. This is 
seen in Fig. [9ljc) where can be distinguished in the 
|0)a|0)b -I- |1)a|1)b corner of the box, preserving rank-2, 
but no other outcomes can distinguish while preserving 
entanglement. Again, increasing the size of Hb allows 
more space for the states and even with i?i = 3 it is 
possible to preserve r2 — 2 (Fig. [DJd)). Notice that in 
Fig. [DJb) and (d), Schmidt rank of Db/2 can be pre- 
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served. In Fig. ^c) we see that -D_b/2 can be exceeded 
for ri , but this is in Une with the theorem, since the rank 
of the other state is i?2 < Da- It is easily seen in the 
latter case that any attempt to increase i?2 to Da would 
destroy the abihty to preserve ri > Db/^. 

When Da and Db do not differ by too much, the the- 
orem gives a non-trivial bound, but since Schmidt ranks 
cannot exceed the smaller dimension, this is no longer 
the case for Db/2> Da (or with A and B reversed). In 
addition, as is illustrated in the right half of Fig.lU^b), the 
presence of additional states can alter conclusions about 
the amount of entanglement it is possible to preserve. 
This demonstrates that the theorem gives only a neces- 
sary, and not a sufhcient, condition for preservation of 
entanglement. 

We now consider a restriction to one-way classical com- 
munication. The next theorem again shows there is a 
tradeoff between the starting and residual Schmidt ranks, 
though here the tradeoff involves both the number of 
states, N, and the average Schmidt rank, R = J^j Rj/^ 

Theorem 8 With Alice going first in a one-way LOCC 
protocol, if for any one oj Alice's outcomes (Am) Bob is 
able to deterministically distinguish the remaining states, 
then 



R<Da + Db/N, 



(26) 



where rj^ refers here to the Schmidt rank of fol- 
lowing Alice 's outcome, and both before and after Bob 
measures (see below). 

Proof: In order for Bob to be able to distinguish with cer- 
tainty after Alice obtains outcome m, the reduced density 
operators of the various possible states remaining must 
be mutually orthogonal (implying that they can preserve 
r™). The rank of each of these reduced density opera- 
tors is r{AIjA'^) (Bob has yet to do anything so I have 
set B = Ib), and their orthogonality implies that the 
sum of these ranks cannot exceed Db- Then, again us- 
ing Eq. (|23p. we have 

N 



Db > ^r(M,A^J 



N 



> J2^Ri+r{Am)- Da) 



N 



= Y,Rj+N{r{Am)-DA)- 



(27) 



With r™ < r{Am)i the theorem follows immediately. ■ 
The following upper bound on the number of states 
will follow as a direct consequence of this theorem: 

Corollary 9 With Alice going first followed by Bob de- 
terministically distinguishing, 

Dr 

N < = , (28) 

T-max + R - Da 
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FIG. 10: Db rank-Dx states that are distinguishable by one- 
way LOCC. 



with 



the largest value of rj' 



Of course, this bound should only be applied if r^ax + 
R > Da; otherwise Eq. is trivially satisfied without 
regard to the value of N. 

If Da < Db and all the states have their maximum 
rank of Rj — Da, we see that the parties cannot pre- 
serve Schmidt rank greater than Db/N, or alternatively 
the number of states cannot exceed DB/ry^nx- If they 
are not concerned with preserving entanglement, then 
setting Tniax = 1 shows that if a set of Schmidt rank Da 
states can be distinguished perfectly by one-way LOCC, 
it cannot have more than Db members. This generalizes 
(at least when restricted to one-way communication) the 
results of [l3, [HI that no more than D maximally en- 
tangled states on D X D can be perfectly distinguished. 
Fig. [To] makes clear that Db rank-DA states can be dis- 
tinguished. These states fill the space, in the sense that 
the sum of Schmidt ranks is equal to the Hilbert space di- 
mension. This diagram makes it seem almost intuitively 
obvious that adding another rank-DA state would make 
it impossible to distinguish (a conclusion which is correct, 
though when dealing with quantum systems, we should 
always be careful about trusting such intuitions). 

In Appendix O it is shown that these bounds for one- 
way CC (Theorem [5] and Corollary ^ can be exceeded 
if two-way CC is allowed. Included in this appendix are 
examples where, depending on the outcome of Bob's mea- 
surement, Alice risks (1) by measuring, the destruction 
of entanglement that would otherwise be preserved; as 
opposed to (2) being unable to distinguish the states if 
she does not measure. Thus, we have interesting and 
non-trivial cases where the main purpose of the classical 
communication is simply to determine whether the next 
party should proceed with any measurment at all. 

In Appendix ini two additional theorems are given, re- 
lated to protocols of type LOCC-KO, where the parties 
are not allowed to communicate until after they com- 
plete their measurements. These theorems address the 
question of always distinguishing with a set of TV rank-Z? 
states on D X D, in which case Theorem |S] tells us that 
the maximum possible residual Schmidt rank is \_D/N\. 
One of these theorem shows that when this maximum 
rank is an integer, and the parties can preserve this rank 
for any single outcome, then they do so for all their out- 
comes using LOCC-PO. The other theorem deals with 
the case of non-integer [D /N\ . 
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VI. DISCUSSION 

A. Relationship to non-collective entanglement 
purification 

When entanglement is shared between two parties un- 
der realistic circumstances, it is very difScult to com- 
pletely eliminate the effects of noise, which may enter in 
the creation of the entangled state or when it is shared 
between the parties through a quantum channel. As a 
result, the parties commonly share a mixed state rather 
than a pure one. Pure state entanglement is, however, 
necessary for many implementations of quantum infor- 
mation processing so it is important to understand when 
the parties will be able to purify their shared state. 

Non-collective entanglement purification [l^, [H, [2^ is 
the process of obtaining a pure entangled state from a 
single copy of a mixed state. The question we are con- 
sidering in this paper is directly related to this process: 
Alice and Bob are given a state drawn from a set 
of N mutually orthogonal, bipartite states with some a 
priori probabilities pj , but are not told which state was 
chosen. They may then describe their system by the 
mixed state, 

N 

/3 = Epj-|*^)(*J-|- (29) 
i=i 

Together, with some probability, they perform an opera- 
tion r, obtaining the new state 

N 

where |$,) = r|v|/,)/vWPlW and q, = 

p,(vi/,|rtr|*,)/Ef.iP,(*,|rtr|*,-).^ 

We want to know if and when p' is a pure entangled 
state. It will certainly be pure if qj — for all j except 
one; that is, if Fj^'j) ^ Sjj, for some fixed J. Whether 
or not it is entangled will depend on the relationship 
between the operator F and the state If it is entan- 

gled, then the parties have identified the original state as 
jvPj) while preserving entanglement, which is the subject 
of the work described in this paper. 

Can p' be pure and entangled when more than one 
of the Qj are nonzero? The answer is yes if and only if 
the nonzero r|\['j) are all the same, up to normalization 
and phase. In other words, entanglement purification is 
possible without distinguishing amongst the eigenstates 
of p, but only if there exists a product operator F sat- 
isfying the above-stated condition. This is equivalent to 
the statement that there must exist a product projec- 
tor such that all the original states not annihilated by 
it were "equivalent" on the support of that projector. 
When such a projector does not exist, then the only pos- 
sibility for non-collective entanglement purification is by 
the methods discussed in this paper. 



Previous discussions of entanglement purification (23|, 
[13, nil have focused on the case where the final state 
is uniformly entangled (all Schmidt coefficients equal 
to each other), corresponding to a maximally entangled 
state on a smaller space. The results presented in this 
paper are concerned instead only with the Schmidt rank 
of the residual state. However, given any pure entan- 
gled state, one can with nonzero probability obtain a 
uniformly entangled state by local operations on the sep- 
arate parts [2^, [23] ■ Therefore the questions addressed in 
this paper, concerning use of LOCC to distinguish a set of 
states and preserve entanglement, are directly related to 
previous discussions of non-collective entanglement pu- 
rification. 



B. Multipartite systems 

Any multipartite system may be viewed as bipartite 
by choosing a division of the parties into two groups. We 
may, for example, let Alice be in one group by herself and 
the remaining parties in the other. A question of interest 
is whether there is a way to extend our various argu- 
ments to also apply when the many parties are viewed 
separately. In order for this to be possible, we must first 
find a suitable generalization of Schmidt rank. Possible 
generalizations have been proposed [H, [2^, but here I 
will only consider a rather simple one: the (generalized) 
Schmidt rank, Rj , of a multipartite quantum state is the 
smallest rank of the completely reduced density operators 

pf, Pf, Pf, •••• 

The proof of Theorem [1] involves each party locally 
and sequentially dividing the composite Hilbert space 
Ha^'Hb into orthogonal subspaces. This division follows 
directly from the fact that their measurement operators 
include ones with non-empty kernel, which is orthogonal 
to that operator's support. Such divisions are not re- 
stricted to bipartite systems, and it is also true for the 
multipartite case that these divisions can be continued 
until all subspaces are no larger than r x r x r x • • •. If 
they must preserve (generalized) Schmidt rank of at least 
r for each of their outcomes then after optimally choosing 
their operations, no part of any state can reside in any of 
the subspaces smaller than this, one state may be placed 
entirely within each of those that are r x r x r x • • •, and 
we see that 

Nr,^a. = L^a/H [i^s/H lDc/r\ ■ ■ ■ (31) 

for any multipartite system, when the parties must pre- 
serve at least r. 

The proof of Theorem [3] for the multipartite case is 
also essentially the same as for bipartite systems, and 
the statement of the theorem applies without alteration. 
That is, if the multipartite states can be completely par- 
titioned by a cascading sequence, then they can be per- 
fectly distinguished while preserving the original states 
intact. The partitioning of the states into subsets again 
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points to a protocol the parties may use, involving or- 
thogonal projections onto combined supports of reduced 
density operators in the appropriate subsets. 



corresponding separable POVM given in Eqs. ([6]) and 
([7]). This work was partially supported by the National 
Science Foundation through Grant PHY-0456951. 



VII. SUMMARY 

In summary, I have introduced the question of pre- 
serving entanglement in the course of locally distinguish- 
ing an unknown state drawn from a set of orthogonal 
states. Several results on this topic have been proved. 
Theorem [T] (generalized by Eq. (PTjl ) gives the achiev- 
able maximum number of states on a multipartite system 
when a (generalized) Schmidt rank of r must always be 
preserved. Theorem [5] showed that for bipartite systems 
and one-way classical communication from Alice to Bob, 
the sum of Schmidt ranks of the states cannot exceed 
Da \_D b It\ , when once again the parties must always 
preserve Schmidt rank of r. The next two theorems con- 
sidered the possibility of preserving the original Schmidt 
ranks of the states in the set. Theorem[3]applies to multi- 
partite systems, and gives a sufficient condition that the 
states can be preserved unchanged. TheoremUthen gives 
a necessary condition for preserving the original Schmidt 
ranks, that the set of density operators ® must 
be mutually orthogonal. Following these results, I then 
proved two theorems that show explicitly a necessary re- 
lationship between the initial and final Schmidt ranks, 
given the parties must always distinguish the state. 

In each case, we discussed how altering restrictions on 
the resources (types of operations and amount of classi- 
cal communication) available to the parties may change 
these results. Various examples were provided illustrat- 
ing this question, including explicit demonstrations of 
the superiority of two-way classical communication over 
protocols where the communication is restricted to be 
in only one direction. In particular, it was shown that 
the sum of Schmidt ranks can exceed the dimension of 
Ti-A ^Ti-B- It may be recalled that Lo and Popescu [3i)J 
have shown that when locally manipulating pure states, 
anything that can be done using two-way communica- 
tion can just as well be done with one-way communica- 
tion alone. For the task of distinguishing a set of states 
and preserving entanglement, we have seen that one-way 
communication may not be sufficient. The reason is that 
we are effectively manipulating a mixed state. This lat- 
ter point was discussed in the previous section, where I 
argued that the question of distinguishing while preserv- 
ing entanglement is closely related to that of purifying 
entanglement from a mixed state. 

Acknowledgments — I would like to thank Yuqing Sun, 
Shengjun Wu, and especially Bob Griffiths for numerous 
enlightening discussions, both specifically on the topic of 
this paper, as well as on quantum information in general. 
I am also very grateful to Li Yu for assistance in counting 
the number of r x r subspaces for the proof of Theorem [1] 
and for sharing the extremely useful set of states and 



APPENDIX A: PROOFS OF THE THEOREMS 
1. Maximum number of states 



Theorem.^ Suppose the parties share a Da ^ Db system 
and using L0CC-K2 are able to distinguish with certainty 
amongst a set of N states while preserving Schmidt rank 
of at least r for every outcome. Then, 

N < [DA/r\ [DB/r\ = N^,^, (Al) 

where \_x\ is the largest integer not greater than x, and 
this upper bound is achievable by LOCC-PO. 
Proof: At some point in the protocol one of the par- 
ties must implement a measurement operator that is less 
than full rank. The reason for this is that in order to 
eliminate any single state in the given set, say |5'j), it 
must be that {A ® B)\^i j) = 0, for some A and B. This 
means either A ot: B must be singular, so has non-trivial 
kernel. If it is Alice who first implements a singular op- 
erator, then that operator divides Alice's Hilbert space 
T-La into two orthogonal parts, its support and its kernel, 
of dimensions Dai and Da — Dai, respectively. If there 
are A^i states that are not excluded, then these states 
must be distinguishable within the remaining Dai x Db 
dimensional space. Furthermore, the N ~ Ni states that 
were excluded lie, from the outset, entirely in the other 
(Da — Dai) x Db dimensional space, so at least this many 
states must be distinguishable in that space. If we de- 
fine a function f(DA, Db) to be the maximum number of 
states perfectly distinguishable while preserving Schmidt 
rank at least r in Da x Db, then Ni < f{DAi,DB) and 
N-Ni< f{DA - Dai,Db). Clearly, 

N = Ni + {N-Ni)< f{DAi,DB) + f{DA ~ Dai,Db). 

The maximum number of states in the original set is then 
bounded above as 

f{DA,DB) < max[f{DAi,DB) + f{DA ~ Dai,Db)], 

(A3) 

and the maximum is taken over all choices of Alice's op- 
erator Ak', in other words, over all ways that she can 
divide her space into two orthogonal pieces. 

We now look for upper bounds on f{DAi,DB) and 
f{DA — Dai,Db) by considering measurements by Bob 
(it could just as well be Alice again) for each of the cases. 
Note that these measurements should be considered as 
completely unrelated protocols; each step in this argu- 
ment involves a "first" measurement (corresponding to 
a singular Kraus operator) in a new protocol aimed at 
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distinguishing a smaller number of states on a smaller 
space. At the second step, we obtain 

}{Da,Db) < max{max[/(i:)Ai,£'si) 

+ f{DAl,DB-DBl)] 
Ak 

+ fiDA~DAi,DB-D'B,)]}, (A4) 
and after many steps, 

n 

f{DA, Db) < max[^ }{DauDbi)], (A5) 
(=1 

with the maximum now taken over operators that se- 
quentially (and by local measurements) divide the origi- 
nal space into n subspaces. 

Since each division represents a successful outcome, 
one of the two subspaces at each step must be at least 
r X r. If any subspace is larger than this, it can be divided 
by a subsequent measurement, so it is valid to continue 
the process until all subspaces are smaller than or equal 
to r X r. Then the maximum in the above equation means 
choosing the best way to divide the space into such sub- 
spaces. Note that /{Dai, Dbi) = 1 if both Dai and Dbi 
are equal to r ( Corollary [7|), and vanishes if either is less 
than r. Hence, the right-hand side of Eq. (|A5|) is equal 
to the maximum number of orthogonal r x r subspaces in 
the original space. We can see A^max is an upper bound 
on this number by assuming otherwise and showing this 
leads to a contradiction. This assumption may be written 
N = Ni + {N-Ni)> [DA/r\ [Db/tI , leading to 

iVi + (7V- A^i) > ilDAi/r\ + l{DA-DAi)/r\)lDB/r\, 

(A6) 

with N and A^i defined above. This implies either A^i > 
[DAi/r\ [DB/r\ or A^ - A^i > [{Da - DAi)/r\ [DB/r\ . 
Following along the argument presented in the preceding 
part of this proof, one eventually arrives at a division for 
which one subspace is r x r and the other is no larger 
than this. We then have that there are either at least 
two states in an r x r subspace (A^i > [r/rj [r/rj — 1), 
or at least one in a subspace smaller than this. This is 
a contradiction, giving us the stated upper bound. Cer- 
tainly, there is no problem fitting Nyna.x r x r subspaces 
into the space, so the bound can be achieved, completing 
the proof. ■ 

2. Schmidt rank sum for one-way protocols 

Theorem [2] // Alice goes first using LOCC-Kl and 
the parties are always able to distinguish and preserve 
Schmidt rank at least r, then Rj < DaYDb/t] . 
Proof: In order for Bob to be able to distinguish with cer- 
tainty following Alice's measurement, the reduced den- 
sity operators of the various possible states remain- 
ing after Alice's outcome must be mutually orthogonal. 



If they must preserve Schmidt rank at least r for each 
outcome, then each of these density operators must have 
rank at least r. These two requirements imply that for 
each of Alice's outcomes, Ak, no more than [D^/rJ of 
the j) can have nonzero probability, or non- vanishing 
(A^Afc (g) Let the eigenstate corresponding to 

nonzero eigenvalue of A\Ak be |aj^). Then, 

{AlAk®lB)\^j)^Y.^l{\al){al\<^^lB)\^,), (A7) 

which vanishes if and only if each term in the sum van- 
ishes. Thus, no more than [_Ds/rJ of the can satisfy 
(|aji^)(a^| (8) lB)\^j) 7^ for any single eigenstate of Al- 
ice's POVM elements. 

From the collection of eigenstates for all these POVM 
elements, choose a (generally non-orthogonal) basis, de- 
noted by {jam)}. Expanding the \^ j) in the dual basis 
{|a,„)}, where ( 

Da 
m— 1 

we see from the arguments of the previous paragraph that 
no more than \D b /r\ of the p.^^ can be non-zero, for any 
fixed TO. 

Now consider the Schmidt ranks. 

Da 

R, = R{\^,))^R{Y,^iUamWm)) 

m—1 

Da 

< Rit,i,\am)\bL))- (A9) 

m—1 

Sum this equation over j to obtain, 

E^^ ^ E (e^(/^™I«™)I^™))) • (Aio) 

j m=l y j J 

Now, i?(^^„|am)|6^„)) = 1 if /x^ ^ and vanishes other- 
wise. Then from the last line of the previous paragraph, 
we have that for each m the quantity in parentheses on 
the right side of this inequality is less than or equal to 
[DB/r\. This yields, 

E^. <DA[DB/r\, (All) 
j 

completing the proof. ■ 
We will now see that the bound in the theorem can be 
reached in all cases not discussed in Section llV Bl (that is, 
whenever that bound does not exceed fminA^max), con- 
sidering first Db > Da- Then we can have a set of 
riB = [-Ds/rJ rank-D^ states, as 

Da-1 

l*j>= E l^)A|fc®«(j-l)r)B, j-l,---,nB, (A12) 

k=Q 
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FIG. 11: Examples illustrating how the bound of Theorem [5] 
can be achieved, (a) The states of Eq. ()A12p when Da < Db, 
with r = 3. Alice's two POVM elements are each rank 3: the 
first annihilates |3)a; the second, |0)a. (b) The states of 
Eq. (|A14[I when Db < Da; each square box is r x r. Note 
that states 10, 11, and 12 are rank-(r + a) with a ^ 0. 



with here indicating addition mod Dg(^j^y Defin- 

ing new quantities ua and a < r through the rela- 
tion Da = riAr + a, we can write Alice's POVM as 

(771 = 1, • • • ,71A - 1), 



E^ = ^\k+ (777 - l)r)A{k + (777 - l)r|, 

fe=0 

r-l 

= ^ afc|fc + {uA - ^)r)A{k + {uA - l)r|, 

fc=0 
r-l 

i?„^+i = J2 ^'k\k + {Da - r))A{k + {Da - r)|,(Al3) 



k=0 



where \i a = Q, — \ and En^+i is to be omitted. When 
a 7^ 0, afe = 1/2 when /c = a • • • , 7' — 1 and a'j, — 1/2 when 
k = 0, •••,r — a — 1; otherwise, these coefficients are 
equal to one. Notice that the last two POVM elements 
have overlapping supports when a 7^ 0, which is why 
some of the coefficients must differ from unity. Whichever 
outcome Alice obtains, Bob can distinguish preserving 
Schmidt rank r. An example of such a set of states is 
given in Fig. [TlTa). 

If DA/nA < Db < Da, we cannot have states with 
Schmidt rank-Z3^, so instead choose 77^(77.^ — 1) states 
having rank r and in addition, ub states having rank 
r + a, with all these states represented as 



1^ 



{n-l)nB+j) 



Y,\k®A {n-l)r)A\k®B (j - 



fe=0 



(A14) 

where 77 = 1 , • • • nA , j = 1 , • • • , 773 , and fc„ = 1 — 1 except 
when n = ua in which case = r + a — 1. Alice 
does the same POVM given in Eq. (jAlSp . and as in the 
previous case for each of her possible outcomes. Bob can 
distinguish preserving r. This set of states is illustrated 
by the example in Fig. [HJb) . 



3. Preserving the original Schmidt ranks 



Theorem |4]// a set of states is perfectly distin- 

guishable by LOCC while preserving Rj, then the density 
operators {pj} form a mutually orthogonal set. 
To prove this theorem, we will use the following lemma, 
in which we refer to a measurement by Alice expressed 
in terms of Kraus operators expanded as 



(A15) 



Then, we can easily prove. 



Lemma 10 Given a complete measurement by Alice, the 
outcomes of which correspond to the operators Ai in 
Eq. liA15\) . and a set of states |C')^ — J2 



Olr, 



A with 



I a: 

the am independent of I and aM 7^ 0. Then {a\j\C,' 
for at least one outcome of Alice 's measurement. 

Proof: This follows from the fact that for a complete 
measurement, we have -^i-^i = Ia, or 



I 



Then we have that 
\ 



a 



M 



I 



41 



(AI6) 



- UM ^ 0. (A17) 



The lemma follows directly. ■ 

We now prove the theorem. 
Proof of Theorem m It will be sufficient to show that the 
parties cannot distinguish a pair of states and preserve 
Rj, j = 1,2, if the density operators pi and p2 are not 
orthogonal. Assuming they are not orthogonal, there are 
two general categories, illustrated in Fig. [SI pertaining to 
the relationship between these two states (Supp(-) means 
support of the indicated operator): 

1. Fig. [5fa) — in a product basis expansion of 



2/l<i>f> 

C Supp(pf 



such that |$2 ) ^ 



[^'2) there is a term |$ 
Supp(pf ) and |<i> 

Fig. [SKb) — if such a term (as in 1. above) is not 
in 1^2) then there must be two terms, |$2^)|^'f ) 
and such that |$^) / Supp(pf) but 

_L Supp(pf ), and _L Supp(pi^) while 



I Supp(pf ). 



Let us consider these two cases separately, beginning with 
the first one. In this case with a convenient choice of 
bases, the states can be written, 

1^2) = (a|0)A+/3|i?i)A)|0)s 

Db-1Da-1 

+ E E fkk'\k')A\k)B, (A18) 

fe=l k'=0 
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where the matrix g has rank a 0, and we have 
l'i'2^)l'J'f ) = a|0)A|0)s (additional such terms do not 
change the conclusion). 

Then, following outcome Ai, written as in Eq. (jAlsp . 
we have 

Ri-l 

M^i) = 9kk'\ai,)A\k)B, 

k,k'=0 
Db-IDa-I 

+ E E hk'\ai,)A\k)B. (A19) 

k=l fe'=0 

Define \C,o)a — a|ao)A + P\a''ji^)A- There are three possi- 
bilities: (a) l^*!) is eliminated by Ai =J> \a\yiA = 0; (b) 
|\E'2) is eliminated by Ai |Co)'4 — 0; or (c) neither is 
eliminated by Ai, so each must continue to have its origi- 
nal Schmidt rank =4> \aQ) a ^ ICo)-4- Since a ^ 0, we 
may conclude from Lemma[TO]there must be at least one I 
such that neither state is eliminated and (oqICq) 0- For 
this outcome, one can again choose a basis with \Q)a de- 
fined as the projection of |Co)a onto Supp(^;/9^A; ) (since 
('^oICo) 7^ ^ Wo) A is in this support, this projection 
is guaranteed to be non-zero). Also choose \Ri)a as the 
projection of |Co)a onto the kernel of Aip^Aj, if this pro- 
jection is non-zero (otherwise /3 = below and the choice 
of is unrestricted, the following conclusion being 

unchanged). This gives 

^/I*i> = E 9kk'\k)A\k)B, 

kM'=0 

A/|*2) = (a|0)A+/3|i?i)A)|0)B 

Db-1 Da-1 

+ E E 'fkk'\k')A\k)B, (A20) 

k=l k'=0 

which has exactly the same form as before Alice's mea- 
surement. By the symmetry between the parties, the 
same conclusion will hold after Bob's subsequent mea- 
surement, and by extension, after they complete an arbi- 
trary number of rounds of measurements. In other words, 
for any LOCC protocol that preserves Rj, there will al- 
ways be an outcome such that they have failed to distin- 
guish between this pair of states. 

For the second case, we can choose bases such that 

1*2) = a\0)A\Ri)B+mi)A\0)B 

Db-1 Da-1 

+ E E fkk'\k')A\k)B, (A21) 
fe=i fe'=i 

with a ^ =/= and after outcome Ai, 

Ai\^2) = aW)A\Ri)B+P\a'njA\0)B 

Db-1 Da-1 

+ E E fkk'\al)A\k)B. (A22) 

k=l k' = l 



First note that if for any single outcome |a'^Jyi is not 
orthogonal to |a'j.,)A yk'=o,---,Ri-i, then we are back to 
the previous case for which we have seen the parties 
cannot distinguish the states and preserve Rj. So we 
may assume this orthogonality in the following. Define 
\C'r,)a = a|ao)A + J2k' fRik'Wk')A- By an argument 
similar to that given for the previous case, there must 
be at least one outcome for which neither state is elimi- 
nated and (oolCfli) 7^ 0. Then choosing a new basis with 
\Ri)a — and \0)a the projection of ICjjj) onto 

S\ipp{Aip^Aj), we have 

Ri-l 

= E 9kk'\k)A\k)B, 
k,k'=0 

Al\^2) = a\0)A\Rl)B+mi)A\0)B 
Db-1 Da-1 

+ E E fkk'\~k')A\k)B. (A23) 
fc=i k'=i 

Once again we see that this has the same form as before 
Alice's measurement, and there is also a symmetry be- 
tween the parties. Hence by the argument used for the 
previous case, we must conclude that they cannot distin- 
guish between these states. This covers all possible cases, 
so the conclusion holds quite generally and the theorem 
follows directly. ■ 



APPENDIX B: PROOF OF NONLOCALITY 
WITHOUT ENTANGLEMENT 

We give here a very simple and transparent proof that 
the nine orthogonal product states of Bennett, et.al. [lH| 



l*l> 


= Wa\1)b, 


1*2) 


= \0)AmB + \l)B), 




= |0)a(|0)b-|1)b), 


1*4) 


= |2)a(|1)b + |2)b), 


1*5) 


= |2)a(|1)b-|2)b), 


1*6) 


= (|1)a + |2)a)|0)b, 


1*7) 


= (|1)a-|2)a)|0)b, 




= mA + \l)A)\2)B, 


1*9) 


= {\0)a-\1)a)\2)b. 



cannot be distinguished by LOCC. The method of proof 
will be to consider general local operations by either 
party, and to show that the only ones that do not destroy 
the mutual orthogonality of the states are proportional 
to unitaries. Then, since unitary operators do not pro- 
vide the parties with any information, and also do not 
alter the relationship between the states, the party who 
goes next can do no better, and so on, no matter how 
many rounds of measurements they make. Hence, they 
are unable to distinguish with certainty. 
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Proof: Due to the symmetry between the parties we 
may suppose Ahce goes first, implementing a completely 
general local operation written as 

2 

A = Y,Wk)A{k\. (B2) 

k=0 

The nine states after this operation become 





|ai)A|l)s, 




1*2) = 


|ao)A(|0)s4 


-|1>b), 


l*3> - 


I«o>a(|0)s- 


|l>s), 


l*4> = 


|a2>A(|l>s 4- 


-|2>b), 


1*5) = 


l«2)A(|l)i3- 


|2)b), 


1*6) = 


i\ai)A + \a2] 


a)\0)b 


1*7) - 


i\ai)A ~ \a2] 


aMb 


l*8> - 


(|ao>A + \ai, 


a)\2)b 


l*9> - 


(|ao>A - |ai, 


a)\2)b 



We require — V^'^j (some of these states may 

vanish identically), since otherwise they cannot distin- 
guish with certainty. Then, considering in turn = 
{1,2}, {1,4}, and {2,4}, we conclude that the states 
\ak)A form a mutually orthogonal set (again, some may 
vanish). Using this fact and considering orthogonality for 
{jii'} — {6, 7} and {8, 9}, we see that the \ak)A must all 
have the same norm; {ak\ak) is independent of k. Hence, 
since we may assume that A does not vanish identically, 
it must be proportional to a unitary operator, which com- 
pletes the proof. ■ 



APPENDIX C: TWO-WAY COMMUNICATION IS 
BETTER THAN ONE-WAY 

For Theorem [5] and Corollary [5] (which, with Da — 
Db= D and Rj = D, state that r < D/N andN < D/r 
for one-way communication, respectively), I now show 
that if they use L0CC-P2, the parties can successfully 
distinguish which state and also, at least for some out- 
comes of their measurements, preserve r ^ D /2 > D/N. 
Let N = 3> D/r = 2, and let I? > 8 be a multiple of 4. 
Take the original states as 

D-l 

1*1) = ^ \k)A\k)B, 
fe=0 
D-1 

|*2> = 5] |fc)A|fc©i?/2)B, 

D-1 

l*3> = ^(-l)'=|fc)A|fc®i^/2)B, (CI) 
fc=0 

where © denotes addition mod D. Alice and Bob can 
each make an orthogonal measurement on their respec- 
tive spaces, with outcomes of rank D/2 represented by 



the projectors 

D/2-1 

Pel = 

fc=0 
D-1 

P0.2 = J2 (C2) 

k=D/2 

One of them, say Alice, tells the other which outcome she 
obtained. If Bob's outcome was the same as Alice's, then 
since Pai PbiI'^'j) = for j = 2, 3 but not for j = 1, he 
then knows that the state was l^^i) and they now share a 
state of Schmidt rank D/2. If their outcomes were both 
1 = 1, for example, then they now have 

D/2-1 

Yl \^')A\k)B. (C3) 

fc=0 

This result is not in contradiction to Theorem [5] nor 
to Corollary[51 however, since if their outcomes were not 
the same, then while the probability that the state was 
I^Pi) now vanishes, the probabilities for the other two 
states are both nonzero. Therefore, in this event one-way 
communication has been insufficient to distinguish the 
state. For example, if Alice obtained outcome 1 while 
Bob obtained 2 (the other case works in a similar way), 
then their system is left in one or the other of 

D/2-1 

1*2) = Y \k)A\k + D/2)B, 

k=0 
D/2-1 

1*3) = E i-l)''\k)A\k + D/2)B. (C4) 

fc=0 

To distinguish between these possibilities, the parties 
must make another round of measurements. Before Alice 
measures, she needs to know from Bob whether or not 
she should. Otherwise, if Bob had obtained the same 
outcome that she obtained, in which case they share the 
state in Eq. (jC3[) . and she goes ahead with the following 
measurement anyway, they will no longer share a state 
of Schmidt rank D/2. This illustrates the results of The- 
orem |5] and Corollary [H] in a somewhat nontrivial way. 

To complete their task. Bob does a measurement with 
projectors {Pb+, Pb-}, where 

Pb± = ^ J2 {\2k + D/2)B±\2k + l + D/2)B) 

X {j^{2k + D/2\± B{2k + l + D/2\). (C5) 

Now, 

1 ^^"^^^ 

Pb±\^2) - 2 ^ (|2fc)^±|2fc+l)^) 

k=Q 

X {\2k + D/2)B±\2k+l + D/2)b), 
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1 ^1^^^ 

Pb±\^z) - 2 ^ (|2fc)^T|2A: + l)^) 

/c=0 

X (|2fc + D/2>B±|2fc+l + D/2)B), 

(C6) 

so Alice can complete the protocol by the measurement 
{Pa+,Pa-}, with 

D/4-1 

Pa±=2 (|2fc)A±|2fc + l)A)G(2fc|± A(2fc + l|). 

fc=0 

(C7) 

If their outcomes are the same (++ or ), the state 

was \^-2) and if different (H — or — h) the state was I'J's). 
In either case, they preserve Schmidt rank of _D/4. 

Another example where two-way communication is 
better than one-way pertains to Corollary [51 which says 
that if > £'/2, a one-way protocol cannot preserve 
entanglement for any single outcome while always dis- 
tinguishing the state. I now give a case where success 
is possible for N > D/2 with a two-way protocol. In 
particular, in _D = 5 the set of A^ = 3 states — |^'o) = 
Efc \k)A\k)B and 1*,) = ^, \k)A\k ® (1 + J = 1, 2 

(with ® again denoting addition mod D) — may be 
distinguished with certainty using a two-way protocol, 
with some outcomes preserving Schmidt rank of 2. One 
such successful outcome is when the parties both obtain 
Pal = |0)a(0| -I- |1)q(1|, a = a, B va their measurements, 
identifying |5'o) as the state. I leave the remainder of 
this protocol (which is not unique) as an exercise for the 
reader. (Here, as in the previous example, communica- 
tion must go both ways after their initial measurements 
in order that they know whether or not to continue. Oth- 
erwise, they risk destroying entanglement in the case they 
preserved it, because they do not know the result of the 
other's measurement.) 



APPENDIX D: PRESERVING \D/N\ WITH 
LOCC-KO 

Here, I will prove two theorems for distinguishing and 
preserving entanglement by LOCC-KO, with the discus- 
sion restricted to sets of Schmidt rank-_D states va DxD. 
According to Theorem [SJ the largest Schmidt rank they 
can preserve is D/N. Let us begin with a theorem con- 
cerning the case when D/N is an integer. 

Theorem 11 Suppose D/N is an integer, and the par- 
ties are restricted to LOCC-KO. If there is a protocol in 
which they are always able to distinguish a set of Schmidt 
rank-D states on a D x D system, and can preserve the 
maximum Schmidt rank of D/N for at least one outcome, 
then they preserve D/N for all their outcomes, and they 
can do so using orthogonal measurements (LOCC-PO). 



Proof: Consider the single outcome that by assumption 
preserves D/N, for which both Alice's and Bob's mea- 
surement operators (say. Am, Bn) must have rank D/N. 
When Alice gets A,„, each of Bob's reduced density op- 
erators, pfm: will have rank D/N. Then to distinguish 
with certainty these density operators must all be mutu- 
ally orthogonal (this is why neither Am or Bn can have 
rank greater than D/N). Given that D/N is an inte- 
ger, this uniquely determines an orthogonal measurement 
Bob may use, with measurement operators having sup- 
port identical to those of these density operators, so each 
has rank D/N. Hence, when Alice gets A,„, they can 
preserve D/N for any of Bob's outcomes, and he may 
just as well do an orthogonal measurement. Recognizing 
that Alice and Bob play completely equivalent roles, this 
argument may be turned around starting with Bob's out- 
come Bn determining an orthogonal measurement Alice 
can choose, thus showing that they both may choose or- 
thogonal measurements for which each of their combined 
outcomes preserves D/N. ■ 

It should be noted that the starting assumption that 
they are always able to distinguish the state is crucial. 
The restricted assumption, that only the single outcome 
{Am,Bn} is known to distinguish the state, certainly 
does not lead to the conclusion that the parties can dis- 
tinguish for all outcomes. This does not even follow from 
knowing that the single outcome on Alice's side Am is 
known to allow Bob to distinguish for every outcome of 
a measurement he can make, a fact that can be seen 
from the following discussion. As argued in the above 
proof. Am determines a set of reduced density operators 
on Bob's side, which determine the allowed supports for 
Bob's measurement operators, which in turn determine 
a set of allowed supports for Alice's measurement oper- 
ators. There is certainly no guarantee that these sets 
of supports, one for each of Bob's outcomes, will all be 
identical. If they are not, Alice will be unable to choose 
a measurement that will distinguish in all cases, and the 
parties must fail for at least some of their outcomes. 

What if D/N is not an integer, but it is known they 
can always distinguish and that one outcome preserves 
Schmidt rank of \_D/N\1 In this case, the extra dimen- 
sions allow for flexibility in the choice of measurements, 
and the conclusion of the previous theorem no longer 
holds. Indeed, the two states {D — h, N — 2) 

4 

1*1) = Y.\k)A\k)B, 

k=0 

1*2) = |0)a|2)b + |1)a|3)b 

+ |2)a|0)b + |3)a|4)b + |4)a|1)s, (Dl) 

are distinguished for all outcomes by the orthogonal mea- 
surements 

Pal = |0)„(0| + |1)„(1|, 

Pa2 = |2),(2| + |3)„(3|, 

Pas = \^)a{M, (D2) 
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but only some outcomes preserve r = 2 (outcome 1 for 
Alice, 1 for Bob, for example) whereas others preserve 
r = 1 (2 for Alice, 1 for Bob). While in this example 
they do not always preserve entanglement, one can easily 
think up other such examples with N > S where they do. 

The following result, which is clearly weaker than The- 
orem [11] applies to the case when D/N is not an integer. 

Theorem 12 Suppose D/N is not an integer, and the 
parties are restricted to LOCC-KO. Then in order for 
them to distinguish a set of Schmidt rank-D states on a 
DxD system and preserve the maximum Schmidt rank of 
lD/N\ for every outcome of their measurement, it must 
be that D = {N + n)\_D/N\, with n a positive integer. 
That is, it must be possible to divide the parties' spaces 
into subspaces all having dimension equal to the maxi- 
mum achievable Schmidt rank, a task for which they can 
use LOCC-PO. 

Proof: First note that if all outcomes preserve \_D/N\, 
each of the parties' measurement operators must have 
rank \ D /N\ (this follows from arguments similar to those 
in the preceding proof). I will show below that any pair 
of measurement operators for either one of the parties 
must have supports that are either orthogonal or iden- 
tical. Then, the measurements divide their spaces into 
orthogonal subspaces each of dimension \_D/N\, and the 
theorem follows immediately. 

The proof is by contradiction. Hence, suppose on the 
contrary Ai and A2 have supports of dimension \_D/N\ 
that are neither orthogonal nor identical to each other. 
Each Am (to = 1, 2) determines a set of iV reduced den- 
sity operators on Bob's space, p^, one for each state 
jvl/j). Each of these density operators has rank \D /N\ 
or 0. In order to distinguish and preserve \_D/N\, Bob 
must choose each of his measurement operators to have 
support containing the support of one of the ^^-'^ 
orthogonal to the others (for each to) . Consider the den- 
sity operators pj^ {p%) for Alice's first two outcomes and 
some fixed state \ and Bob's corresponding measure- 
ment operators Bi (-82)- The support of the latter must 
be chosen to contain the range of MjAi {AIJA2) [Mj 
is the matrix corresponding to in Eq. (I2ip ). Given 
l^'j) is rank-D, then Mj is non-singular, and the fact 
that the supports of Ai and A2 are neither orthogonal 
nor identical implies the same fact about the supports 
of Bi and i?2. In particular, the range of MjAi, which 



is the support of pj^, intersects both the support and 
the kernel of B2- This implies that when the state is 
l^'j) and Alice measures Ai, there is a non-zero prob- 
ability that Bob will measure B2 (since the support of 
B2 is not orthogonal to Pji)- When this occurs, Bob's 
reduced density operator becomes pj^ = B2PJ1BI (ig- 
noring unimportant normalization). But given that the 
support of pj^ is not orthogonal to the kernel of B2, the 
rank of pj^ will be strictly less than that of p^^. That 
is, the Schmidt rank of their residual shared state, which 
is equal to the rank of pj^, is strictly less than [D/iVj, 
the rank of pj-^ . This contradicts the conditions of the 
theorem, implying that Ai and A2 must have supports 
that are either orthogonal or identical to each other. ■ 
Theorem [12] only gives a necessary condition, so it says 
nothing as to whether or not the ability to divide into 
equal size subspaces of dimension ID/N\ is sufficient for 
the parties to accomplish this task. The following exam- 
ple shows that when D — {N + n)lD/N\, there exists 
at least one set of N states that can always be distin- 
guished by LOCC-PO preserving the maximal possible 
Schmidt rank. The states are 

D 

= EI^)-4I^®L^/^J(j-i))b, j = (D3) 

fc=0 

with © again denoting addition mod D. The parties do 
the orthogonal measurements, 

[D/N\l-1 

Pc.i= \k)a{k\, I = !,■■■, N + n. (D4) 

k=lD/N\{l-l) 

Then, Pai Psi'l'^j) vanishes except [Slj for the single 
state j satisfying I' = [j + I — 1) mod {N + n), and since 
1 < i < A^, then for fixed I the possible set of values for 
I' aiel,l + l,---,{l + N-l) mod {N + n). When it does 
not vanish, it is equal to 

ID/N\{1+1)-1 

Pai®Pbv\^j)^ J2 \k)A\k(BlD/N\{j~l))B, 

k=[D/N\l 

(D5) 

which is of Schmidt rank ID/N\. Thus, all outcomes 
with non-zero probability distinguish, preserving \_D/N\ . 
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